# Zero-shot classification

The Teacher V 2
This is a transformers model for zero-shot classification tasks, which can classify text without a large amount of labeled data.
Text Classification Transformers
T
shiviklabs
172
0
Clip Vitb16 Test Time Registers
A vision-language model based on the OpenCLIP-ViT-B-16 architecture. By introducing test-time registers to optimize the internal representation, it solves the problem of feature map artifacts.
Text-to-Image Transformers
C
amildravid4292
517
0
Industry Project V2
Apache-2.0
An instruction fine-tuned model optimized based on the Mistral architecture, suitable for zero-shot classification tasks
Large Language Model
I
omsh97
58
0
Fg Clip Large
Apache-2.0
FG-CLIP is a fine-grained vision and text alignment model that achieves global and region-level image-text alignment through two-stage training, enhancing fine-grained visual understanding ability.
Multimodal Alignment Transformers English
F
qihoo360
538
3
Smartshot Zeroshot Finetuned V0.2.0
A zero-shot classification model fine-tuned from MoritzLaurer/deberta-v3-base-zeroshot-v2.0-c using the SmartShot method with an NLI framework
Text Classification English
S
gincioks
197
0
Smartshot Zeroshot Finetuned V0.1.2
MIT
A zero-shot classification model fine-tuned based on roberta-base-zeroshot-v2.0-c, enhanced with SmartShot method and synthetic data
Text Classification Other
S
gincioks
119
0
Marfin Emotion
Apache-2.0
An emotion detection model fine-tuned based on mDeBERTa-v3, supporting emotion classification in Indonesian and English
Text Classification Transformers Supports Multiple Languages
M
MarfinF
185
0
Modernbert Large Nli
Apache-2.0
A natural language inference model optimized through multi-task fine-tuning based on the ModernBERT-large model, excelling in zero-shot classification and NLI tasks.
Large Language Model Transformers Supports Multiple Languages
M
tasksource
61.52k
5
Modernbert Base Zeroshot V2.0
Apache-2.0
A zero-shot classifier fine-tuned based on ModernBERT-base, efficient and fast with low memory usage, suitable for various text classification tasks.
Text Classification Transformers
M
MoritzLaurer
261
17
Modernbert Large Zeroshot V2.0
Apache-2.0
A zero-shot classifier fine-tuned based on ModernBERT-large, efficient and fast with low memory usage, suitable for various text classification tasks.
Large Language Model Transformers
M
MoritzLaurer
25.66k
47
Llm Jp Clip Vit Large Patch14
Apache-2.0
A Japanese CLIP model trained based on the OpenCLIP framework, trained on a dataset of 1.45 billion Japanese image-text pairs, supporting zero-shot image classification and image-text retrieval tasks
Text-to-Image Japanese
L
llm-jp
254
1
Resnet50x64 Clip Gap.openai
Apache-2.0
CLIP model image encoder based on ResNet50 architecture with 64x width expansion, using Global Average Pooling (GAP) strategy
Image Classification Transformers
R
timm
107
0
Resnet50x16 Clip Gap.openai
Apache-2.0
A ResNet50x16 variant model based on the CLIP framework, focused on image feature extraction
Image Classification Transformers
R
timm
129
0
Resnet50x4 Clip Gap.openai
Apache-2.0
ResNet50x4 variant model based on the CLIP framework, designed for image feature extraction
Image Classification Transformers
R
timm
170
0
Resnet50 Clip Gap.openai
Apache-2.0
A ResNet50 variant based on the visual encoder part of the CLIP model, extracting image features through Global Average Pooling (GAP)
Image Classification Transformers
R
timm
250
1
Resnet50 Clip Gap.cc12m
Apache-2.0
CLIP-style image encoder based on ResNet50 architecture, trained on CC12M dataset, extracting features through Global Average Pooling (GAP)
Image Classification Transformers
R
timm
19
0
Vit Large Patch14 Clip 224.dfn2b
Other
A vision transformer model based on the CLIP architecture, focused on image feature extraction, released by Apple.
Image Classification Transformers
V
timm
178
0
Modernbert Large Zeroshot V1
MIT
A natural language inference model fine-tuned based on ModernBERT-large, specifically designed for zero-shot classification tasks
Text Classification Transformers English
M
r-f
54
2
Vit Huge Patch14 Clip 224.laion2b
Apache-2.0
ViT-Huge visual encoder based on the CLIP framework, trained on the laion2B dataset, supports image feature extraction
Image Classification Transformers
V
timm
1,969
0
Vit Base Patch32 Clip 224.laion2b
Apache-2.0
Vision Transformer model based on CLIP architecture, designed for image feature extraction, trained on the laion2B dataset
Image Classification Transformers
V
timm
83
0
Convnext Base.clip Laiona
Apache-2.0
ConvNeXt Base model based on the CLIP framework, trained on the LAION-Aesthetic dataset, suitable for image feature extraction tasks.
Image Classification Transformers
C
timm
14
0
Modernbert Base Nli
Apache-2.0
ModernBERT is a model fine-tuned on multi-task natural language inference (NLI) tasks, excelling in zero-shot classification and long-context reasoning.
Large Language Model Transformers Supports Multiple Languages
M
tasksource
1,867
20
Llm Jp Clip Vit Base Patch16
Apache-2.0
Japanese CLIP model trained on OpenCLIP framework, supporting zero-shot image classification tasks
Text-to-Image Japanese
L
llm-jp
40
1
Deberta Zero Shot Classification
MIT
A zero-shot text classification model fine-tuned on DeBERTa-v3-base, suitable for scenarios with scarce labeled data or rapid prototyping.
Text Classification Transformers English
D
syedkhalid076
51
0
LLM2CLIP Openai L 14 224
Apache-2.0
LLM2CLIP is an innovative approach that leverages large language models (LLMs) to unlock the potential of CLIP. It enhances text discriminability through a contrastive learning framework, breaking the limitations of the original CLIP text encoder.
Text-to-Image
L
microsoft
108
5
LLM2CLIP Openai B 16
Apache-2.0
LLM2CLIP is an innovative method that leverages large language models (LLMs) to extend CLIP's capabilities, enhancing text discriminability through a contrastive learning framework and significantly improving cross-modal task performance.
Text-to-Image Safetensors
L
microsoft
1,154
18
Bart Large Mnli Openvino
MIT
This is the OpenVINO optimized version of the facebook/bart-large-mnli model for zero-shot text classification tasks.
Text Classification
B
Smashyalts
16
0
Vit Base Patch16 Clip 224.metaclip 2pt5b
A dual-framework compatible vision model trained on the MetaCLIP-2.5B dataset, supporting both OpenCLIP and timm frameworks
Image Classification
V
timm
889
1
Resnet50 Clip.yfcc15m
MIT
ResNet50 model trained on the YFCC-15M dataset, compatible with both open_clip and timm frameworks, supporting zero-shot image classification tasks.
Image Classification
R
timm
631
0
Deberta Small Long Nli
A compact zero-shot classification model based on DeBERTa architecture, optimized for long-text natural language inference tasks, converted to ONNX format for web compatibility
Text Classification Transformers
D
onnx-community
19
1
EXLMR
Apache-2.0
EXLMR is an extended version of XLM-R that supports new languages by expanding the tokenizer vocabulary to mitigate out-of-vocabulary issues, specifically optimized for low-resource Ethiopian languages.
Large Language Model Transformers Other
E
Hailay
27
0
Marqo Fashionsiglip
Apache-2.0
Marqo-FashionSigLIP is a multimodal embedding model optimized for fashion product search, with a 57% improvement in MRR and recall rate compared to FashionCLIP.
Text-to-Image Transformers English
M
Marqo
493.25k
44
Deberta V3 Nli Onnx Quantized
Apache-2.0
Quantized ONNX model based on DeBERTa-v3-base, suitable for zero-shot text classification tasks
Text Classification Transformers English
D
pitangent-ds
25
0
Safeclip Vit H 14
Safe-CLIP is an enhanced vision-language model designed to mitigate risks associated with Not Safe For Work (NSFW) content in AI applications.
Text-to-Image Transformers
S
aimagelab
30
0
Gliclass Large V1.0
Apache-2.0
An efficient zero-shot classifier trained on synthetic data, suitable for topic classification, sentiment analysis, and reranking tasks in RAG workflows.
Text Classification Transformers English
G
knowledgator
80
5
Gliclass Base V1.0
Apache-2.0
GLiClass is an efficient zero-shot classifier inspired by GLiNER, suitable for text classification, sentiment analysis, and reranking tasks in RAG workflows.
Text Classification Transformers English
G
knowledgator
152
3
Gliclass Small V1.0
Apache-2.0
An efficient zero-shot classifier trained on synthetic data, suitable for topic classification, sentiment analysis, and reranking tasks in RAG workflows
Text Classification Transformers English
G
knowledgator
416
2
Gliclass Base V1.0 Lw
Apache-2.0
GLiClass is an efficient zero-shot classifier trained on synthetic data, suitable for text classification, sentiment analysis, and reranking tasks in RAG workflows.
Text Classification Transformers English
G
knowledgator
57
2
Biomed Right
A zero-shot classification model based on the Transformers library, capable of performing classification tasks without task-specific training data.
Text Classification Transformers
B
gritli
15
0
Deberta Base Long Nli
Apache-2.0
Based on the DeBERTa-v3-base model, the context length is extended to 1280, and fine-tuned for 250,000 steps on the tasksource dataset, focusing on natural language inference and zero-shot classification tasks.
Large Language Model Transformers
D
tasksource
541
23
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase